Speaker consistency in the realization of prosodic prominence in the Boston University Radio Speech Corpus
نویسنده
چکیده
An analysis is presented on the rate of inter-speaker consistency in the way multiple speakers realize prosodic events when they read the same scripts. The analysis is made on the Boston University Radio Speech Corpus (BURSC). The BURSC consists of data from five speakers (3 female and 2 male), each reading the same scripts that comprise more than 110 different sentences. The design of the corpus, thus, proves to be a useful basis on which we can measure the degree of speaker variation or speaker consistency in prosodic realization. A pair-wise comparison of inter-speaker consistency is made regarding the rendition of prosodic prominence. The results indicate that the average rate of consistency on the presence or absence of pitch accent is 89.81%. An average consistency of 72.17% is achieved for the rate of consistency for the types of the pitch accent. The finding implies that there is a constraint that is imposed on an utterance by speakers regarding prosodic prominence placement, as well as certain degree of variation between speakers in rendering prosodic prominence.
منابع مشابه
Simultaneous recognition of words and prosody in the Boston University Radio Speech Corpus q
This paper describes automatic speech recognition systems that satisfy two technological objectives. First, we seek to improve the automatic labeling of prosody, in order to aid future research in automatic speech understanding. Second, we seek to apply statistical speech recognition models of prosody for the purpose of reducing the word error rate of an automatic speech recognizer. The systems...
متن کاملSyllable-level prominence detection with acoustic evidence
Accurate prominence annotation benefits many spoken language understanding tasks as well as speech synthesis. In this work, we conduct a thorough study using acoustic prosodic cues for prominence detection in speech. This study is different from previous work in several aspects. In addition to the widely used prosodic features, such as pitch, energy, and duration, we introduce the use of cepstr...
متن کاملSpeaker variation in English prosodic boundary*
Yoon, Tae-Jin. 2014. Speaker variation in English prosodic boundary. Linguistic Research 31(1), 1-23. This paper analyses the rate of inter-speaker consistency in the way multiple speakers render prosodic events when they read the same scripts. Prosodically labeled data of five speakers from the Boston Radio Speech Corpus (BURSC) are used to measure the degree of speaker variation in rendering ...
متن کاملThe Prosody of Discourse Structure and Content in the Production of Persian EFL Learners
The present research addressed the prosodic realization of global and local text structure and content in the spoken discourse data produced by Persian EFL learners. Two newspaper articles were analyzed using Rhetorical Structure Theory. Based on these analyses, the global structure in terms of hierarchical level, the local structure in terms of the relative importance of text segments and the ...
متن کاملSimultaneous recognition of words and prosody in the Boston University Radio Speech Corpus
This paper describes automatic speech recognition systems that satisfy two technological objectives. First, we seek to improve the automatic labeling of prosody, in order to aid future research in automatic speech understanding. Second, we seek to apply statistical speech recognition models of prosody for the purpose of reducing the word error rate of an automatic speech recognizer. The systems...
متن کامل